2 


a 


Predictions of nuclear charge radii based on the convolutional neural network* 


Ying-Yu Cao,! Jian-You Guo,? and Bo Zhou! t 


! Key Laboratory of Nuclear Physics and Ion-beam Application (MOE), 
Institute of Modern Physics, Fudan University, Shanghai 200433, China 
> School of Physics and Optoelectronics Engineering, Anhui University, Hefei 230601, China 
Shanghai Research Center for Theoretical Nuclear Physics, 
NSFC and Fudan University, Shanghai 200438, China 


In this study, we developed a neural network that incorporates a fully connected layer with a convolutional 
layer to predict the nuclear charge radii based on the relationships between four local nuclear charge radii. The 
convolutional neural network (CNN) combines the isospin and pairing effects to describe the charge radii of 
nuclei with A > 39 and Z > 20. The developed neural network achieved a root-mean-square (RMS) deviation 
of 0.0195 fm for a dataset with 928 nuclei. Specifically, the CNN reproduced the trend of the inverted parabolic 
behavior and odd—even staggering observed in the calcium isotopic chain, demonstrating reliable predictive 


capability. 
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I. INTRODUCTION 


Nuclear charge radius is a fundamental property of the nu- 
cleus, along with its mass, density, spin, and moments [1- 
3]. This reveals vital information regarding the nuclear struc- 
ture, such as shell closure, halo effect, and electromagnetic 
properties [4-6]. To date, various experimental techniques 
have been employed to measure the nuclear radius, includ- 
ing the scattering of high-energy particles in the nucleus [7], 
atomic spectroscopy [8, 9], X-ray spectroscopy excited by 
p atoms [10, 11], and isotopic shifts in the ultrafine struc- 
ture of the atomic spectra [12]. The CR2013 database con- 
tains the root-mean-square (RMS) nuclear charge radii for 
909 isotopes of 92 elements, from hydrogen to curium [13], 
with neutron numbers ranging from 0 to 152. More recently, 
the measurement of charge radii have been extended to nuclei 
much beyond the {-stability line. The latest CR2021 database 
includes data for 129 charge radii of nuclei, with A > 21 and 
Z > 14[14. 

Numerous theoretical methods have been developed to 
describe and predict charge radii, which can be broadly 
classified into three major categories: macroscopic, micro- 
scopic, and traditionally macroscopic. Microscopic models 
are based on effective and real interactions to study the struc- 
ture and properties of nuclei. Microscopic models include the 
Hartree—Fock—Bogoliubov (HFB) model, relativistic mean- 
field (RMF) theory, ab initio calculations, and the cluster 
model [15-19]. For instance, the HFB21 model achieved a 
high level of accuracy for predicting the charge radii, with 
an RMS deviation of 0.027 fm for 782 measured charge 
radii [16]. Traditional macroscopic models rely on phe- 
nomenological laws to describe charge radii, including the 
A‘? and Z!/3 laws [21-25], where A represents the mass 
number and Z denotes the number of protons. In addition, 
the nuclear charge radii can be accurately predicted using 
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the Garvey—Kelson relationship. As such, these relationships 
describe the nuclear charge radii based on the relative posi- 
tion of a given nucleus with respect to other known charge 
radii [26-28]. Although phenomenological models can ac- 
curately describe the global trends in nuclear charge radii, 
they often struggle to reveal the underlying physical quan- 
tities within the individual nucleus. 


Machine learning is widely applied in physics owing to 
the availability of large datasets and powerful hardware tech- 
nologies. It has proven to be highly effective in process- 
ing complex data, uncovering hidden patterns, and solving 
stochastic processes. In fields such as high-energy physics 
and astrophysics that requires processing enormous amounts 
of data, machine learning is essential for analyzing the evolu- 
tion and morphology of galaxies and extracting information 
from high-energy reactions [29-36]. In nuclear physics, ma- 
chine learning techniques such as artificial neural networks 
(ANNs) [37], Bayesian neural networks (BNNs) [39-44], and 
radial basis functions (RBFs) have been used to improve the 
accuracy of predictions of nuclear properties such as nuclear 
mass, charge radius, and (-decay half-lives [45-52]. Fur- 
thermore, convolutional neural networks (CNNs) have been 
used to solve problems pertaining to density functional the- 
ory [54, 55]. Currently, fully connected neural networks are 
generally employed to describe the properties of nuclei in nu- 
clear physics, with several weights and high computational 
complexities. To mitigate this complexity, we considered a 
CNN with local connections and weight sharing. 


In this paper, we present a novel approach for improving 
the prediction of nuclear charge radii within local charge ra- 
dius (CR) relations by utilizing a CNN. In Section 2, we pro- 
vide a brief overview of the theoretical background of local 
CR relations and the principles of CNN. In Section 3, we de- 
scribe the proposed methodology for training a CNN using 
existing experimental data on nuclear radii. We present the 
results of our analysis, including the global RMS deviation 
obtained by the proposed model, and provide relevant dis- 
cussions and predictions based on the findings. Finally, the 
present findings and the conclusions of this study are summa- 
rized, highlighting the potential of CNN as powerful tools for 
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improving the prediction of nuclear charge radii. 


I. THEORETICAL FORMALISM 


The Garvey—Kelson relationship was introduced in 1966 as 
a method for predicting the mass of nuclei. Owing to the com- 
plexity of the Hamiltonian for a multibody system, the mass 
of a nucleus cannot be directly calculated from the first prin- 
ciples. Instead, the mass of a given nucleus can be accurately 
inferred from the masses of its neighboring nuclei [57, 58]. 
Similarly, four local CR relations were proposed [27]: which 
are based on the charge radii of the four neighboring nuclei 
and are referred to as the 1n — 1p, In — 2p, 2n — 1p, and 
2n — 2p CR relations, respectively. 


ORin—jp = 0, i,j = 1,2, 
i= RN ZS RO) 
-R(N-i,Z)—- R(N,Z- j) @ 


As reported in Ref. [27], the RMS deviation was 0.0078 fm 
for all nuclei with N, Z > 2. However, the Garvey—Kelson 
relationships are expressed as local formulas. The radii of the 
neighboring nuclei must be known to obtain the radius of a 
specific nucleus. Inspired by Refs. [27, 28], we aimed to uti- 
lize local CR relationships to characterize the global nuclear 
radius. 

When i = 1 and j = 1, dRin—1p represents the charge 
radii relationship between the four nearest neighbor nuclei, 
and the following four subrelations can be derived similarly: 


5Rin—1p = R(N, Z) + R(N —1,Z — 1) (2) 
=R(N —1,2) — R(N,Z = 1), 

5Rin-tp = R(N +1,Z +1) + R(N, Z) (3) 
-R(N,Z +1)— R(N +1,2), 

5Rin-1p = R(N +1, Z) + R(N,Z — 1) (4) 
—R(N,Z)— R(N +1,Z — 1), 

5Rin-1p = R(N, Z +1) + R(N — 1, Z) (5) 
-R(N —1,Z +1) = R(N, Z). 


Therefore, for 7,7 = 1, 2 , we can utilize 16 subrelations to 
calculate R(N, Z). Subsequently, these 16 subrelations are 
added to obtain a radius relation for the 25 nuclei, and 25 
nuclear charge radii were reshaped into a 5x5 matrix with 
the corresponding coefficients. This local relationship is dis- 
played in Fig. 1 (a). Similarly, we considered the local re- 
lation of the nine nuclear charge radii, as depicted in Fig. 1 
(b). 

CNNs are commonly applied in the field of computer vi- 
sion [59]. They can identify local features in images using 
fewer parameters and effectively reduce overfitting. By ar- 
ranging the charge radii from CR2013 for nuclei with Z > 
18 (Fig. 1), a 79 x 153 nuclear charge radius matrix was 
formed. The coefficient matrix in Fig. 1 is treated as a con- 
volutional kernel in CNNs. The sliding of the convolutional 
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kernel across the matrix forms a convolution process that is 
crucial for CNNs. The two-dimensional convolution formula 


is stated as follows: 
> da gli, j)h 


where O(u, v) denotes the value of the element at coordinates 
(u, v) in the output matrix and g(i, 7) denotes the value of the 
element at coordinates (i, j) in the input matrix. Similarly, 
h(u — i, v — j) represents the value of the element at the co- 
ordinates (u — i, v — j) in the convolution kernel. 

The empirical formula developed by Nerlo—Pomorska and 
Pomorski [23]: 


Í), (6) 


(u— iv — 


b 41/3 
7^4 bi 
where 79 = 0.9561 fm, a = 0.1426, and b = 2.1057. This 
three-parameter formula improves the description of the ra- 
dius of even—even nuclei, and the RMS deviation is 0.0385 
fm for nuclei with A > 39 and Z > 20, which were used as 
the training set. Thus, odd-even staggering crucially affects 
several isotopes. Therefore, by introducing a ô term into Eq. 
(7), we can obtain: 


R.(Z, N, A) = roll 


(7) 


N-Z b ô 
a a tata 
where ro =0.9562 fm, a = 0.1429, b = 2.0825, and c = 
0.0694. The parameters were obtained by fitting the train- 
ing set and the RMS deviation was 0.0385 fm. Similarly, we 
constructed a pairing term for 6 local relations to describe the 
odd-—even staggering effect of the charge radius for the nuclei 
portrayed in Fig. 1 (c). Considering the characteristics of the 
nuclei near the 8-stability line, we added the isospin depen- 
dence T, which is also referred to as the NP relation [44]. 6 
and T, defined as follows: 


Re(Z, N, A) = roll Jae, 


(8) 


(9) 


The structure of the neural network is depicted in Fig. 2. 
The neural network comprised an input layer, a fully con- 
nected layer, a convolutional layer, and two output layers. 
The input data are the proton number Z, neutron number N, 
and mass number A of 928 nuclei. The activation function 
from the input layer to the fully connected layer is Mish(x), 
and its formula is expressed as follows: 

Mish(x) = z - tanh(In(1 + e”)). (10) 
denotes the output of a fully connected layer. These three 
layers formed a fully connected (FC) net. We obtained a CR 
matrix with 79 rows and 153 columns using FC net. We then 
considered the CR matrix as the input to the convolutional 
layer. As depicted in Fig. 2, the convolutional layer com- 
prises four components: convolution of the nuclear radius 
matrix with 5x5, 3x3 CR relations, 5x5 isospin dependence 
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Fig. 1. (Color online) (a) radii relations between 25 neighboring nuclei, visualized as a matrix formed by rearranging the 25 nuclei and 
convolving it with the coefficient matrix. (b) radii relations between 9 neighboring nuclei. (c) odd—even staggering effect across 25 nuclei. 
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Fig. 2. (Color online) Neural network structure used herein. Input data comprises proton number Z, mass number A, and neutron number 
N. Neural network has two hidden layers with eight and four neurons, respectively. The hidden layer processes the data to output numerous 
nuclear charge radii. The data were reshaped into a matrix and fed into the convolutional layer for further processing. After the convolutional 
layer has processed the data, the final output was obtained. The stride was 1 for the convolutional layer. 


(NP relations), and 5x5 delta relations. Thereafter, we as- 
signed each component a weight w; and added it to the final 
output of the neural networks. The calculation process for the 
convolutional layer can be expressed as 


4 
f(O) = $ | R(wi04) +b, (11) 


where R(x) represents the activation function and O; repre- 
sents the convolutional result, as depicted in Eq. (6), and b de- 
notes the bias. w and b indicate the training parameters of the 
neural network. Finally, we obtain an output convolutional 
neural network. The form of the loss function is consistent 
with the RMS deviation (ø). 


Pa 1/2 
Loss = 5 (REP: — ROM)? /m : 


i=1 


(12) 


where, RỌ®!- and REP- denote the neural network predic- 
tions and experimental values of the dataset, respectively, and 
m indicates the number of dataset. 


Table 1. RMS deviation of training set, validation set, and entire 
set for Eq. (8) and the proposed neural network (CNN). 


Training set Validation set Entire set 
gEa (8) 0.0384 0.0332 0.0378 
oONN 0.0201 0.0156 0.0195 
gE4.(8)_ CNN 
Rais) 0.48 0.53 0.48 
183 Ill. RESULTS AND DISCUSSIONS 


Based on local CR relations, we propose a neural network 
185 Containing convolutional layers to study nuclear charge radii. 
186 We aimed to explore whether local CR relations could de- 
187 Scribe the global nuclear charge radii and their ability to pre- 
188 dict the charge radii of unknown nuclei. 

189 In this study, datasets were obtained from CR2013 and 
190 CR2021 [13, 14]. To test the predictive ability of the neu- 
191 ral networks, we selected 814 charge radii data points with 
192 Z > 20 from CR2013 as the training set and 114 charge radii 
193 data points from CR2021 as the validation set. The entire set 
194 included the training and validation sets, which contained 928 
195 nuclear-charge radii. 
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Fig. 3. (Color online) Charge radii residuals DR = REP’ — RE* 
(b). 


Similar to alternative neural networks [37, 38, 44], we con- 
sidered the proton number Z, neutron number N, and mass 
number A as inputs for the neural network. Owing to the lo- 
cal connection property of the convolutional layer, we must 
generate the nuclear charge radius data through the fully con- 
nected layer and arrange them into a matrix of 79 rows and 
153 columns. Here, 79 represents the number of protons, 
ranging from 18 to 96, and 153 represents the number of neu- 
trons, ranging from 0 to 152. When the loss value of the val- 
idation set diminishes to minimum, the generalization of the 
neural network is optimal. Therefore, we compute the RMS 
deviation of the entire set for the CNN. The rootaATmean-— 
square (RMS) deviations for Eq. (8) and CNN for the train- 
ing, validation, and entire sets are presented in Table 1. The 
CNN yielded an RMS deviation of 0.0195 fm for the entire 
dataset and 0.0156 fm for the validation dataset. Compared 
to Eq. (8), the entire set is improved by 48% and the accuracy 
of the validation set significantly improved by 53%. 

Subsequently, we provide the radius residuals ôR between 
the experimental Ree: and the CNN predictions Roa yie, 


ÔR = RPP — By (13) 


The radius residuals ôR of 928 nuclei with proton number 
Z are denoted in Fig. 3. As observed, |R| < 0.04 fm for the 
validation set. | R| > 0.06 fm for certain heavy nuclei in the 
training set, especially near the isotopes of Z = 65 and 80. In 
Ref. [61], Figure 1 displays the significant radii differences 
for isotopes near N = 60, 90, and Z = 80. Near the N = 
60 and 90 regions, the enlarged amplitudes of the radii differ- 
ences are attributed to the heavy nucleus with a strong static 
deformation [62—64]. The abnormal radius difference in the 
Z = 80 region is owing to shape staggering, indicating that 
the shape of the heavy nuclei changes from prolate to oblate, 
and thereafter, back to prolate [61, 65, 66]. 

The RMS radii for calcium isotopes from the CNN, Eq. (7), 
Eq. (8), and experimental data are comparatively presented in 
Fig. 4 (a). The experimental data revealed a significant odd- 
even staggering in the charge radius of the calcium isotopes 
between N = 20 and N = 28, with a pronounced shell ef- 
fect at N = 28. Although Eq. (8) contains the 6 term, it fails 
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` between experimental data and CNN of training set (a) and validation set 


to provide a satisfactory description of the experimental data. 
In addition, the amplitudes of the odd—even staggering calcu- 
lated by the CNN were smaller than those of the experimen- 
tal data. Furthermore, CNN calculations indicate an inverted 
parabolic trend between N = 20 and N = 28 along the cal- 
cium isotopic chain [5, 67, 68]. The current predictions for 
the radii of 3849-51Ca isotopes yielded results were aligned 
with the available experimental data. The results indicate that 
the CNN is reliable and provides predictive ability. However, 
the shell kinks at N = 20 and N = 28, as demonstrated 
by the CNN, were insignificant. Certain studies have em- 
ployed the macroscopic-microscopic method, incorporating 
shell corrections and quadrupole and hexadecapole deforma- 
tions to achieve shell kinks of calcium isotopic chains [5, 67]. 
As depicted in Fig. 4 (b), CNN fails to accurately reproduce 
the prominent shell kinks of zirconium isotopes at N = 50, 
suggesting that accounting for the shell effect may improve 
the description of nuclear charge radii. 


In Ref. [25], a five-parameter formula was proposed that 
included many physical features such as the shell effect and 
odd-even staggering. The RMS deviation of the charge ra- 
dius for nuclei with A > 40 was 0.023 fm. Recently, 
Ref. [44] combined the pairing and shell effects using a three- 
parameter formula and BNN to describe the nuclear charge 
radius. The RMS deviation is 0.015 fm for the entire set. In 
Ref. [53], the inputs of BNN with four engineered features, 
namely, the pairing effect, shell effect, isospin effect, and 
"abnormal" shape staggering effect. The proposed method 
obtained an RMS deviation of 0.014 fm for the training and 
validation datasets. The BNN outperformed the CNN for de- 
scribing the trend of the nuclear charge radius, indicating that 
the shell and shape staggering effects can considerably de- 
scribe the trend of the nuclear charge radius [5, 60]. 


The difference between the theoretical results of the CNN 
and the experimental data for all the CR in the entire set is 
shown in Fig. 5. As observed, the residual of the charge 
radius of the nucleus located near the shell is less than 0.04 
fm, implying that the neural network combined with local CR 
relations can accurately describe the nuclear charge radius. 
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Fig. 4. (Color online) Charge radii of (a) calcium and (b) zirconium isotopes predicted by Eq. (7), Eq. (8), and CNN, compared with the 
experimental data. 


Fig. 5. (Color online) Differences between predicted and experimental values of charge radii for 928 nuclei. 


274 IV. SUMMARY 25 There are limitations to the application of convolutional 
236 neural networks in terms of the charge radii of the nuclei. 
27 Existing experimental data on the nuclear charge radius are 
28 limited and scattered. Therefore, we generated data in fully 

27s We developed a neural network with a convolutional layer 289 connected layers, such that the regular convolutional kernel in 

27 to analyze the nuclear charge radii of 928 nuclei. We in- 290 the convolutional layer acted on the data. Presently, the graph 

277 corporated the isospin-dependence T and pairing relations ô 2% neural networks and deformable convolutional kernels have 

27a into the convolutional layer to account for the physical factors 292 offered significant advantages in processing irregular data. 

279 that influence the nuclear charge radius. The network model 293 Perhaps the graph convolutional neural networks use GCNs 

230 achieved an RMS deviation of 0.0201 fm for nuclei with A > 24 to overcome the limitations of the nuclear charge radius data. 

231 39 and Z > 20 in the CR2013 database, and 0.0156 fm for 

2322 CR2021 as the validation set. The proposed neural network 
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